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Abstract 


We pose and study a fundamental algorithmic problem which we term mixture selection^ 
arising as a building block in a number of game-theoretic applications: Given a function g 
from the n-dimensional hypercube to the bounded interval [—1,1], and an n x m matrix A 
with bounded entries, maximize g(Ax) over x in the m-dimensional simplex. This problem 
arises naturally when one seeks to design a lottery over items for sale in an auction, or craft 
the posterior beliefs for agents in a Bayesian game through the provision of information (a.k.a. 
signaling). 

We present an approximation algorithm for this problem when g simultaneously satisfies two 
“smoothness” properties: Lipschitz continuity with respect to the L°° norm, and noise stability. 
The latter notion, which we define and cater to our setting, controls the degree to which low- 
probability — and possibly correlated — errors in the inputs of g can impact its output. The 
approximation guarantee of our algorithm degrades gracefully as a function of the Lipschitz 
continuity and noise stability of g. In particular, when g is both 0(1)-Lipschitz continuous and 
0(l)-stable, we obtain an (additive) polynomial-time approximation scheme (PTAS) for mixture 
selection. We also show that neither assumption suffices by itself for an additive PTAS, and 
both assumptions together do not suffice for an additive fully polynomial-time approximation 
scheme (FPTAS). 

We apply our algorithm for mixture selection to a number of different game-theoretic ap¬ 
plications, focusing on problems from mechanism design and optimal signaling. In particular, 
we make progress on a number of open problems suggested in prior work by easily reducing 
them to mixture selection: we resolve an important special case of the small-menu lottery 
design problem posed by Dughmi, Han, and Nisan [DHN14] ; we resolve the problem of revenue- 
maximizing signaling in Bayesian second-price auctions posed by Emek et al. [EFG~*~12 and 
Miltersen and Sheffet |BMS12| ; we design a quasipolynomial-time approximation scheme for the 
optimal signaling problem in normal form games suggested by Dughmi |Dugl4| ; and we design 
an approximation algorithm for the optimal signaling problem in the voting model of Alonso 
and Camara |AC14| . 
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1 Introduction 


Lotteries, beliefs, mixed strategies — all are distributions arising as important objects in game 
theory. It is unsurprising, therefore, that algorithmic game theory is rife with algorithmic problems 
which — implicitly or explicitly — optimize over the space of distributions, or equivalently the 
simplex. In this paper, we identify a family of algorithmic problems over the simplex which arise 
over and over in game theory. We term problems in this class mixture selection, and examine their 
computational complexity. 

Given a function g from the solid n-dimensional hypercube to the bounded interval [—1,1], and 
an integer m, we define the m-dimensional mixture selection problem for g as follows. The input to 
this problem is an n x m matrix A with bounded entries, and the objective is to compute x E A^, 
maximizing g{Ax). It is natural to expect that the computational complexity of mixture selection 
depends crucially on the “complexity” of the function g. We therefore identify two “smoothness” 
parameters of the function g which control the extent to which mixture selection is tractable, and 
derive a simple approximation algorithm with guarantees degrading gracefully in those parameters. 
Moreover, we present evidence — in the form of hardness results — that smoothness in both senses 
is necessary for the kind of general results we obtain. 

The hrst smoothness quantity is a familiar one, namely Lipschitz continuity in the metric. 
The second quantity, which we define and term noise stability, borrows ideas from related definitions 
of stability in other contexts (e.g. [KKL88t IMOOIO] ). though is importantly different. Informally, 
a function g from the solid n-dimensional hypercube to the real numbers is ^ -noise stable (or /3- 
stable for short) if the random corruption of an a-fraction of the n inputs to g, with no individual 
input disproportionately likely to be corrupted, does not decrease the output of g by more than 
ajd. We note that a Fourier-analytic notion of stability is closely-related to ours — we elaborate 
on this connection in Section [9l 

This paper lays out a framework for tackling mixture selection problems, and presents a number 
of applications in mechanism design and optimal signaling in games. Notably, we find that we 
resolve or make progress on a number of known open problems, and some new ones, using our 
framework. 

Our Results 

Our results for mixture selection can be viewed as generalizing the main insights of Lipton et 
ah [LMMOS] . First, we show that when g is noise stable and Lipschitz continuous, and x E 
is arbitrary, there is a sparse vector x for which g{Ax) is not much smaller than g(Ax). The 
proof of this fact proceeds by sampling from x and letting x be the empirical distribution, as in 
[LMM03] . However, when g is sufficiently noise stable and Lipschitz continuous, we obtain a better 
tradeoff between the number of samples required and the error introduced into the objective than 
does [LMMOSj . and this is crucial for our applications. Our analysis bounds the expected difference 
between g{Ax) and g{Ax) as the sum of two terms: The hrst term represents the error in the output 
of g caused by the low-probability “large errors” in its n inputs, and the second term represents the 
error in the output of g introduced by the higher-probability “small errors” in its n inputs. The 
hrst term is bounded using noise stability, and the second is bounded using Lipschitz continuity. 

Second, we instantiate the above insight algorithmically, as does [LMM03j . Specihcally, our 
algorithm enumerates vectors x of the desired sparsity in order to hnd an approximately optimal 
solution to our mixture selection problem. We note that our guarantees are all parametrized by 
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the Lipschitz continuity c and the noise stability 13 of the function g. Most notably, we obtain an 
additive polynomial-time approximation scheme (PTAS) whenever both f3 and c are constants. 

Third, we rule out certain natural extensions of our results assuming well-believed complexity- 
theoretic conjectures. We show that neither Lipschitz continuity nor noise stability alone suffices 
for an additive PTAS for mixture selection, and both together do not suffice for an additive fully 
polynomial-time approximation scheme (FPTAS). For a function which is 0(l)-stable yet 0(1)- 
Lipschitz continuous only in the metric, we show approximation hardness by a reduction from 
the NP-hard maximum independent set problem. For a function which is 0(1)-Lipschitz in L°° yet 
not 0(l)-stable, we show approximation hardness by a reduction from the planted clique problem. 
Finally, for a function which is both 0(1)-Lipschitz in L°° and 0(l)-stable, we rule out an additive 
FPTAS via a reduction from the maximum independent set problem. 

Despite the simplicity of our framework, we find that it has powerful implications for problems 
in mechanism design and optimal signaling in games. We feature four natural applications in this 
paper, three of which resolve or partially resolve outstanding open problems from prior work: 

1. Lottery design: Dughmi, Han, and Nisan |DHN14j examined one of the most basic problems 

in mechanism design: that of designing the revenue-maximizing multi-item auction for a single 
unit-demand buyer with valuation represented implicitly via a sampling oracle. They reduced 
this problem to a regularized variant of itself, namely optimally designing a small number 
of lottery-price pairs — a small menu — from which the buyer is allowed to choose. We 
apply our framework to resolve the special case of this problem with a single lottery — i.e., a 
menu of size 1. This follows from the Lipschitz continuity and noise stability of the function 
^Qottery)(^) . /[i- > p]} for an arbitrary weight vector w G A„, where I[£] 

is the indicator function for the event £. 

2. Revenue-maximizing signaling in probabilistic second-price auctions: Emek et 
al. [EFG^l^ and Miltersen and Sheffet |BMS12j considered signaling in the context of a 
probabilistic second-price auction. In particular, the attributes of the item for sale are un¬ 
known, and the auctioneer must decide what information to reveal in order to maximize his 
revenue in this auction. This is particularly relevant in advertising auctions, where items 
are impressions associated with demographics that are a-priori unknown to the advertisers 
bidding in the auction. Whereas both papers presented a polynomial-time algorithm for this 
problem when bidder types are fixed, the general problem was shown to be NP-hard and its 
approximability was left largely open. Using our framework, a PTAS for the general problem 
follows easily. We use the fact that the function max2, which simply returns the second 
largest entry of a vector, is Lipschitz continuous and noise stable. 

3. Persuasion in voting: Alonso and Camara [AC14| examine a simple election for selecting 
a binary outcome — say whether a ballot measure is passed — when voters are not fully 
informed of the consequences of the measure, and hence of their utilities. Each voter casts a 
Yes/No vote, and the measure passes if the fraction of Yes votes exceeds a certain pre-specified 
threshold. A principal — say a moderator of a political debate — can determine the protocol 
— or signaling scheme — through which information regarding the measure is gathered and 
shared with voters. We consider a principal concerned with maximizing the probability of 
the measure passing. |AC14] characterize the optimal signaling scheme and a number of its 
properties, though stop short of deriving an algorithm for optimal signaling. We design a 


3 












multi-criteria PTAS for this problem using our framework. Along the way, we also design a 
bi-criteria PTAS for the related problem of maximizing the expected number of Yes votes in 
the election. For both results, we use the fact that the function ^{™te-sum)= ^| {i : > 0} | 

is noise stable and Lipschitz continuous in a bi-criteria sense. 

4. Optimal signaling in normal form games: Dughmi |Dugl4| examined the problem of 
optimal signaling in abstract normal form games, and ruled out an FPTAS even for two- 
player zero-sum games. The possibility for a PTAS for two-player zero-sum games, and a 
QPTAS for general games with a constant number of players, were left open. We show that 
a bi-criteria QPTAS for normal-form games with a constant number of players follows from 
our framework, and applies to a large and natural class of objective functions. We use the 
fact that every function is 0(n)-stable, and the fact that the function measuring the quality 
of equilibria satisfies a bi-criteria notion of Lipschitz continuity which we define. 

Additional Discussion of Related Work 

As previously described, our framework generalizes and refines the main insight of |LMM03] . The 
recent work of Barman |Barl5| is also similar in spirit; in particular, the approximate variant of 
Caratheodory’s theorem employed in that paper can be viewed as a mixture selection problem with 
g{Ax) = —||Ax —Ax*||p for a fixed vector x* and normp > 2. Even though this function g is neither 
Lipschitz continuous in L°° nor noise stable. Barman exhibits a PTAS under the assumption that 
the columns of A have small (i.e. constant) p-norm. 


2 Framework 


For a function g : [—1,1]*^ ^ [—1,1] and a positive integer m, we define the following optimization 
problem which we term m-dimensional mixture selection for g: given an n x m matrix A with entries 
in [—1,1], find x in the m-dimensional simplex Am maximizing f{x) := g{Ax). In this section, we 
present our notion of noise stability, and derive approximation algorithms for this problem when the 
function g is simultaneously noise stable and Lipschitz continuous with respect to the metric. 
Moreover, we show that neither requirement alone suffices for our results. 

Our approximation guarantees will be additive — i.e., an e-approximation algorithm for mixture 
selection outputs x E with f{x) > maXygA^^ f{y) — e- To illustrate our techniques, we use the 
following function : [—1,1]"- [—1,1], which averages all but the top and bottom quartiles 

of its inputs, as a running example. 




1 

[Sn/d] — [n/4j 


[ 371/41 

i=[n/4j-|-l 


where t[j] denote the i^'^ largest entry of t. Throughout the paper, we use ti to denote the entry 
of t, and use to denote the i^^ largest entry of t. 

Though we present our framework for functions g : [—I,!]” —>■ [—1,1], we define mixture 
selection similarly for functions g : [0,1]”' —>■ [0,1]. The two definitions are equivalent up to 
normalization, and it is easy to verify that all our results and bounds for mixture selection carry 
through unchanged to either definition. 
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Our main result applies to functions g which are both noise stable and Lipschitz continuous 
with respect to the metric. We now formalize these two conditions. 


Lipschitz Continuity 

A function g : [—1,1]*^ ^ [“1)1] is c-Lipschitz continuous in L°° — or c-Lipschitz for short — if 
and only if for all t,t' in the domain of g, \g{t) — g{t')\ < c\\t — To illustrate, our example 

function is 1-Lipschitz. We note that Lipschitz continuity in L°° is a stronger assumption 

than in any other norm. 


Noise Stability 

Our notion of noise stability captures the following desirable property of a function g : [—1,1]" ^ 
[—1,1]; if a random process corrupts (i.e., modifies arbitrarily) some of the inputs to g, with no 
individual input disproportionately likely to be corrupted, then the output of g does not decrease 
by much in expectation. Such random corruption patterns are captured by our notion of a light 
distribution over subsets of [n], dehned below. 

Definition 2.1 (Light Distribution). Let V be a distribution supported on subsets of [n]. For 
a G (0,1], we say V is a-Light if and only if the following holds for all i G [n]: FrR^x>{i G i?] < a. 

In other words, a light distribution bounds the marginal probability of any individual element 

of [n]. When corrupted inputs follow a light distribution, no individual input is too likely to be 

corrupted. However, we note that our notion of light distribution allows arbitrary correlations 

between the corruption events of various inputs. We define a noise stable function as one which 

is robust, in an average sense, to corrupting a subset R of its n inputs when R follows a light 

distribution V. Our notion of robustness is one-sided: we only require that our function’s output 

not decrease substantially in expectation. This one-sided guarantee suffices for all our applications, 

and is necessitated by some. We note that the light distribution V, as well as the (corrupted) 

inputs, are chosen adversarially. We make use of the following notation in our dehnition: Given 

vectors t,t' G [—1,1]"^ and a set i? C [n], we say ss t if L = t[ for all i ^ R. In other words, if 

R 

t' ~ t, then t' is a result of corrupting the entries of t corresponding to R. 

R 

Definition 2.2 (Noise Stability). Given a function g : [—1,1]” ^ [—1,1] and a real number /3 > 0, 
we say g is fd-stable if and only if the following holds for all t G [—1, Ij”, a G (0,1], and a-Light 
distributions V over subsets of [n] .• 


E 

R'^R 


mm{g{t’) :t' ^t} 
H 


> g{t) - 0/3. 


To illustrate this definition, we show that our example function is 4-stable. To see this, 

observe that changing k entries of the input to can decrease its output by at most ^. When 

R is drawn from an a-Light distribution and t is an arbitrary input, 4-stabihty therefore follows 
from the linearity of expectations: 


E 

R'^R 


mm{g^^^^\t') : t' ^ t} 
R 


> E 

R'^R 




m' 

n 


>^(mid)(^) 
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We note that every function g : [—1,1]” —)• [—1,1] is 2n-stable, which follows from the union bound. 

As a useful building block for proving some of our functions stable, we show that stable functions 
can be combined to yield other stable functions if composed with a convex, nondecreasing, and 
Lipschitz continuous function. 

Proposition 2.3. Fix /3,c > 0, and let gi,g 2 , ■ ■ ■ ,gk ■ [—1,1]*^ —^ [—1,1] be fd-stable functions. For 
every convex function h : [—1,1]^ —)■ [—1,1] which is nondecreasing in each of its arguments and 
c-Lipschitz continuous in L°°, the function g{t) := h{gi{t),... ,gk{t)) is {I3c)-stable. 


Proof. For all t G [—1,1]'^ and all a-Light distributions F, 


E [min(7(t')]= E \mmh(qi(t'), 

R R 

■ ■■,gk{t'))] 


> E [/i(ming'i(t'), 

R 

■ ■ • 

(Since h is nondecreasing) 

> h{ F [min O'! (t')], 

R 

..., E [min(/fc(t')]) 

tir^U 

R 

(Jensen’s inequality) 

> h{gi{i) - al3,.. .,gk{t) - ajd) 

(Stability of each gi) 

> h{gi{f),... ,gk{t)) 
= g{t) - afc. 

— aftc 

(Lipschitz continuity of h) 


As a consequence of the above proposition, a convex combination of /3-stable functions is /3-stable, 
and the point-wise maximum of /3-stable functions is /3-stable. 

Consequences of Noise Stability and Lipschitz Continuity 

We now state the two main results of our framework. Both results apply to functions g : [—1, Ij” ^ 
[—1,1] which are simultaneously Lipschitz continuous and noise stable, and nx m matrices A with 
entries in [—1,1]. Given a vector x G and integer s > 0, we view x as a probability distribution 
over [m], and use the random variable x G Am to denote the empirical distribution of s i.i.d. 
samples from x. Formally, x = -Y2i=i^ki: where A:i,...,/cs G [m] are drawn i.i.d. according to 
X, and Cj G Am denotes the standard basis vector. Since x is the average of s standard basis 
vectors, we say it is s-uniform. 

Definition 2.4. We refer to a distribution y G Am as s-uniform if and only if it is the average of 
a multiset of s standard basis vectors in m-dimensional space. 

Our first result shows that when the number of samples s is chosen as a suitable function of 
the Lipschitz continuity and noise stability parameters, g{Ax) is not much smaller than g{Ax) in 
expectation over x. At a high level, we bound this difference as a sum of two error terms; one 
accounts for the effect of low-probability large errors in the inputs t = Ax to g, and the other 
accounts for effect of higher-probability small errors in the inputs t. The former error term is 
bounded using noise stability, and the latter error term is bounded using Lipschitz continuity. 

Theorem 2.5. Let g : [—1,1]"^ ^ [“1)1] be fd-stable and c-Lipschitz in L°°, let A be an n x m 
matrix with entries in [—1,1], let a,5 > 0, and let s > 21n(^)/(5^ be an integer. Fix a vector 
x G Am, and let the random variable x denote the empirical distribution of s i.i.d. samples from 
probability distribution x. The following then holds: F[g{Ax))\ > g{Ax) — aft — c5. 
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Proof. Denote t = Ax and t = Ax. Note that t is a random variable. Also note that ti and 

ti can be viewed as the mean and empirical mean, respectively, of a distribution supported on 

... ,Ai^rn £ [—1, !]• We say the entry of t is approximately preserved if \ti — ti\ < 5, and 

we say it is corrupted otherwise. Let R C [n] denote the set of corrupted entries. Hoeffding’s 

inequality, and our choice of the number of samples s, imply that R follows an a-Light distribution. 

Let t' be such that (1) t[ = ti for i ^ R, and (2) t'^ = ti otherwise. Observe that t' ~ t, and 

R 

||t' — t||oo ^ We can now bound the expected difference between g{t) and g{t) as a sum of the 
error introduced by corrupted entries and the error introduced by the approximately preserved 
entries of t: 


9{t) - E[g(t)] = E[c/(t) - g{t')] + E[g{t') - g{t)] < a(d + cS. ^ 

Notice that if we fix the desired approximation error e, the minimum required number of 
samples s in Theorem 12.51 to guarantee that E[ 5 ((Ax))] > g{Ax) — e is obtained by minimizing 
r21n(f)/i21 over a, J > 0 satisfying ajd + 5c < e. Therefore, the required number of samples 
depends only on the error term e, the noise stability parameter /3, and the Lipschitz continuity 
parameter c; in particular, it is independent of n and m. 

As a corollary of Theorem 12.51 we derive the following algorithmic result. 

Theorem 2.6. Let g : [—1,1]”' jd-stahle and c-Lipschitz, and let m > 0 be an inte¬ 

ger. For every 5,a > 0, the m-dimensional mixture selection problem for g admits an {a/3 + c5)- 
approximation algorithm in the additive sense, with runtime n • - T, where T denotes 

the time needed to evaluate g on a single input. 

Proof. Let s > 21n(2/a)/5^ be an integer. Our algorithm simply enumerates all s-uniform distri¬ 
butions, and outputs the one maximizing g{Ax). This takes time n ■ • T. The approximation 

guarantee follows from Theorem 12.51 and the probabilistic method. ■ 

As a consequence of Theorem l2.6l the mixture selection problem for admits a polynomial¬ 

time approximation scheme (PTAS) in the additive sense. The same holds for every function g 
which is 0(l)-stable and 0(1)-Lipschitz continuous. Specifically, by setting ct = ^ and <5 = ^, an 

e-approximation algorithm runs in time n • -T. Interestingly, neither noise stability 

nor Lipschitz continuity alone suffices for such a PTAS, as we argue in the next subsection. 

The Necessity of Both Noise Stability and Lipschitz Continuity 

We now present evidence that both our assumptions — Noise stability and Lipschitz continuity — 
appear necessary for general positive results along the lines of those in Theorem 12.61 

1. Stability alone is not sufficient. In Section [8Tl we define a function : [0,1]” —>■ [0,1] 

which is 1-stable. Furthermore, is 0(1)-Lipschitz with respect to the metric, which 

is a weaker property than Lipschitz continuity with respect to L°°. We show in Theorem 18.41 
that there is a polynomial-time reduction from the maximum independent set problem on 
n-node graphs to the n-dimensional mixture selection for g'(®^°P®). Moreover, the reduction 
precludes a polynomial-time e-approximation algorithm in the additive sense for some con¬ 
stant e > 0. 
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2. Lipschitz continuity alone is not sufficient. One might hope to prove NP-hardness of mixture 
selection in the absence of stability. However, we are out of luck in this regard: since every 
function g : [—1,1]” ^ is 2n-stable, Theorem 12.61 implies a quasipolynomial-time 

approximation scheme in the additive sense whenever g is 0(1)-Lipschitz. Nevertheless, we 
prove hardness of approximation assuming the planted clique conjecture ( [Jer 92] and |Kuc95| b 

More specifically, in Section 18.21 we exhibit a reduction from the planted fe-clique problem 
to mixture selection for the 3-Lipschitz function (t) = t[^ — -|- t[n]- When k = 

uj{log^ n) and A is the adjacency matrix of an n-node undirected graph G, we show that 
maxa; {Ax) ~ 1 with high probability if G contains a /c-clique, and max^, ~ 

^ with high probability if G is the Erdos-Renyi random graph G{n, ^). 

A Bi-criteria Extension of the Framework 

We have already showed that in the absence of Lipschitz continuity, one can not hope for a PTAS 
in general. Motivated by two of our applications, namely Optimal signaling in normal form games 
and Persuasion in voting, we extend our framework to the design of approximation algorithms 
for mixture selection with a bi-criteria guarantee when the function in question is stable but not 
Lipschitz continuous. We first define a (<5, /?)-relaxation of a function. 

Definition 2.7. Given two functions g, h : [—1,1]” ^ [—1,1] and parameters 5,p > 0, we say h is 
a {5, p)-relaxation of g if for all ti,t 2 G [—1,1]” with ||ti — ^ 2 ] loo < d, h{t 2 ) > g{ti) — p. 

In lieu of the Lipschitz continuity condition, we prove our bounds for a relaxation of the function. 

Theorem 2.8. Let g : [—1,1]” ^ [~1)1] de ffistable, let A be an n x m matrix with entries in 
[—1,1], let a > t) and S, p > 0, and let s >2 ln(|-)/(j^ be an integer. Fix a vector x G A^, and let the 
random variable x denote the empirical distribution of s i.i.d. samples from probability distribution 
X. The following then holds for any {6, p)-relaxation h of g, 

E[h{Ax))] > g{Ax) — a(3 — p. 

Proof. Because the proof is almost identical to the proof of Theorem 12.51 we just mention the 
necessary modifications. Again, let t = Ax, let t = Ax, let R C [n] denote the set of corrupted 
inputs, and let t' be such that t{ = for i G i? and t{ = L otherwise. Then 

g{t) - E[h{t)] = E[g{t) - g{t')] + E[g{t') - h{t)] 

< aft E[ 5 r(f') — h{t)] 

< aft p, 

where the first inequality follows by noise stability of g, and the last inequality follows from the 
fact that /i is a {5, /oj-relaxation of g. ■ 

Having replaced Theorem 12.51 bv Theorem 12.81 a similar computational result as Theorem 12.61 
can be inferred in the bi-criteria sense. 






3 Lottery Design for Revenue Maximization 


To illustrate the utility of our framework, we start with a simple but basic open problem in Bayesian 
mechanism design posed by Dughmi, Han, and Nisan [DHN14j . An instance of the lottery design 
problem is given by a valuation matrix A G [0,and n non-negative weights tci,..., Wn with 
= 1. Here n denotes the number of buyer types, m denotes the number of items, and 
w represents a probability distribution over types. Each 0 < Aij < 1 is the value of item j to a 
buyer of type i. The goal is to design a single lottery-price pair {x,p), with x G Am and p > 0, so 
that the expected revenue of the auction which offers the lottery x over items at price p to a buyer 
with type drawn according to w is maximized. We assume the buyer is risk neutral, and therefore 
accepts the offer precisely if his type i satisfies AiX > p. Consequently, our goal is to choose {x,p) 
maximizing p ■ ^ P])^ where I[£] denotes the indicator function for the event £. 

The lottery design problem is closely related to the general unit-demand single-buyer mechanism 

design problem considered in |DHN14j . where the buyer’s type is drawn from a common knowledge 

prior distribution B given by a sampling oracle, and the buyer is to be presented with a menu 

consisting of several lottery-price pairs from which to choose. [DHN14j frame this mechanism 

design problem as a computational task of “learning” a good mechanism by sampling from B, and 

use the size of the menu as a regularization constraint in order to prevent over-fitting the mechanism 

to the sampled data. The problem of maximizing the expected revenue by using a menu of at most 

a given size is the main algorithmic question in [DHN14j . and its computational complexity is 

left largely open. When constrained to a menu with a single lottery, the goal is to choose (x,p) 

maximizing the expected revenue p ■ Pr \ax > p\. 

a^B 

Using our mixture selection framework, we first give an additive PTAS for the lottery design 
problem when the value distribution B is given explicitly by the matrix A and weights w as described 
above. We then extend our result to cases in which B can only be accessed through sampling, 
using fairly standard uniform convergence arguments. In Section [8.31 (Theorem [8^ . we rule out 
an additive FPTAS for this problem, and in doing so provide complexity-theoretic evidence that 
our PTAS — for both lottery design in particular and mixture selection for stable and Lipschitz- 
continuous functions more generally — is essentially the best we can hope for. 

3.1 Lottery Design in the Explicit Input Model 

Given the number of buyer types n and weights w G A„, the lottery design problem is simply 
mixture selection for the function : [0, !]”■ ^ [0,1], defined below. 

n 

^aottery)(^) = niax{p • • I[U > p])} (3.1) 

As the first step to applying our framework, we show that is noise stable. The high 

level idea is the following: if a subset of the inputs to is corrupted, then in the worst case 

each such input exceeded the price p before corruption but not after corruption. This reduces the 
output of by at most the total weight of corrupted inputs. When corrupted inputs are 

chosen according to an a-Light distribution, their expected total weight is bounded by a. 

Lemma 3.1. The function is 1-stable. 
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Proof. Let t € [0,1]” be an arbitrary input to . When t' is obtained from t by corrupting the 

entries corresponding to i? C [n], an event we denote by t' ^ t, it is easy to see that > 


^Pottery)(^) where wi^R) = denotes the total weight of corrupted entries. Moreover, 

when is a random variable drawn from an a-Light distribution T>, we can bound the expected 
loss: E[r(;(ii)] = X]r=i ^ R] ' Wi < awi = a. It follows that is 1-stable. 


E 


IX 


> s£°"”’>(t) - E |u.(fl)| 


Next, we prove Lipschitz continuity. The high-level idea is the following: if all inputs to 
decrease by 5, then we need only decrease the price p from expression (j3.1j) by 5. The (weighted) 
fraction of inputs exceeding the price is at least the same as before. 

Lemma 3.2. The function is 1-Lipschitz continuous. 

Proof. Consider t,t' G [0,1]" with ||t' — t||oo < <5- We prove that gw°^^^'^^\t') > gw°^^^^^\t) — 5, and 
by symmetry it follows that {t) > {t') — 6. Let p be the optimal price for t — i.e. the 

maximizer of expression (|3.ip — and define p' = max{0,p—<5}. Whenever L > p we have > p^ and 

therefore Y > pD > {wi ■ I[ti > p]). It follows that {t') > {t) — 6. ■ 

i=l i=l 

Combining Lemmas 13.II and 13.21 with Theorem 12.61 yields an additive PTAS for lottery design. 

Theorem 3.3. There is an additive PTAS for the lottery design problem when the valuation dis¬ 
tribution is given explicitly by a matrix A G [0,1]”'^™' and a weight vector w G A„. 


3.2 Lottery Design in the Sample Oracle Model 

So far, we have assumed that the buyer’s type distribution is given explicitly. We now show how to 

extend our results to the sample oracle model. Specifically, we assume the buyer’s type a G [0,1]™ 

is drawn from a distribution B given by a sampling oracle, and seek a randomized approximation 

scheme with runtime (and number of samples) polynomial in m for each desired approximation 

guarantee e. To simplify exposition we assume B has finite support, though our results hold more 

generally. As usual, our goal is to choose a lottery x G and a price p > 0 to maximize 

Revg(x,p) = p • Pr [ax > p]. 

a^B 

Theorem 3.4. There is an additive polynomial-time randomized approximation seheme (PRAS) 
for the lottery design problem in the sample oraele model. 

Proof. Recall that the PTAS in Theorem 13.31 optimizes over all s-uniform m-dimensional lotteries, 
where s depends only on the desired approximation guarantee e > 0, and is bounded by a polynomial 
in A In particular, s is independent of the number of types n, implying that the same approach of 
enumerating all s-uniform lotteries x would succeed in the sample oracle model were it not for our 
inability to evaluate maxp Revg(x,p) exactly. 

We overcome this difficulty by Monte Carlo sampling from B. Given n types oi,..., G [0,1]™ 
sampled from B and presented as the rows of a matrix A G [0, we run the PTAS from 
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Theorem 13.31 with approximation parameter e on the empirical distribution given by A and uniform 
weights Wi = ^ for all i. Taking n to be a suitable polynomial in m, and log(^) — where 7 > 0 
is a parameter — guarantees that |Revg(5:,p) — p • 7 ^ P]| ^ ^ simultaneously for all 

s-uniform lotteries x and prices p with probability at least 1 — 7 . This follows from standard tail 
bounds and the union bound, coupled with a uniform convergence argument over prices p € [ 0 , 1 ]. 
Therefore, with probability 1 — 7 our algorithm outputs a lottery-price pair whose expected revenue 
for a buyer drawn from B is within 0 (e) from the optimal. ■ 


4 Signaling 

In the next few sections, we consider a number of Bayesian games in which a key parameter 9, the 
state of nature, in part determines the payoff structure of the game. We use 0 to denote the set of 
all states of nature, and assume 0 G 0 is drawn from a common-knowledge prior distribution which 
we denote by A. In all our applications, we assume players a-priori know nothing about 0 other 
than its prior distribution A, and examine policies whereby a principal with access to the realized 
value of 9 may commit to a policy of revealing information to the players regarding 9. This is often 
referred to as signaling (see e.g. |EF0+12[ IRMS121 IDIBIdl |Dugl4| ). We restrict our attention to 
symmetric signaling schemes, in which the principal must reveal the same information to all players 
in the game. Thus, a symmetric signaling scheme is given by a set S of signals, and a (possibly 
randomized) map (p from states of nature 0 to signals S. The goal of the principal, who is privy to 
confidential state-of-nature information, is to boost her own objective by using a signaling scheme 
(p that optimally affects the outcome of the game. 

In this section, in addition to providing the technical background for signaling schemes, we use 
our framework to define an abstract signaling problem and characterize its approximation complex¬ 
ity. This abstract problem captures the essence of all signaling problems considered in this paper. 


4.1 Background: Signaling Schemes 

Let m = |0|. Abusing notation, we use p{9, a) to denote the probability of announcing signal a G T, 
conditioned on the state of nature is 0 € 0. It is well known l |KG09l Dugl4| ) that signaling schemes 
are in one-to-one correspondence with convex decompositions of the prior distribution A G A^,: 
Formally, a signaling scheme : 0 —)• S corresponds to the convex decomposition A = 
where (1) Va = Pi' 0 ..^©[(^( 0 ) = cr] = is the probability of announcing signal a, and 

( 2 ) Pa{9) = Pr 6 i...o 0 [ 0 | 7 >( 0 ) = a] = is the posterior belief distribution of 9 conditioned 

on signal a. The converse is also true: every convex decomposition of A G corresponds to a 
signaling scheme. Alternatively, the reader can view a signaling scheme ip as the m x |S| matrix of 
pairwise probabilities p{9,a) satisfying conditions (1) and (2) with respect to A G A^- 

Note that each posterior distribution p G Am defines a Bayesian game, and the principal’s 
utility depends on the outcome of the game. Given a suitable equilibrium concept and selection 
rule, we let / : A^ —> K denote the principal’s utility as a function of the posterior distribution p. 
For example, in an auction game f{p) may be the social welfare or principal’s revenue at the 
induced equilibrium, or any weighted combination of players’ utilities, or something else entirely. 
The principal’s objective as a function of the signaling scheme p can be mathematically expressed 
by F{p,X) /(f^t). 
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In this setup, the optimal choice of a signaling scheme is related to the concave envelope of the 
function / ( |KG09l [Dugl4| ) li| Specifically, such a signaling scheme achieves Yla ' filler) = /"'"(A). 
Thus, there exists a signaling scheme with m + 1 signals that maximizes the principal’s objective, 
by applying Caratheodory’s theorem to the hypograph of /. 

4.2 An Abstract Signaling Problem and its Polynomial-Time Approximation 

To connect to our mixture selection framework, we consider signaling problems in which the prin¬ 
cipal’s utility f{p) from a posterior distribution /r E can be written as g{Ap) for a function 
g : [—1,1]"' —7> [—1,1] and a matrix A E [—1,1]"^”^. As described in Section l4.ll a signaling 
scheme ip with signals S corresponds to a family of probability-posterior pairs {(t'o-, Mo-)}o-gE de¬ 
composing the prior A E into a convex combination of posterior distributions (one per signal): 
A = X^o-es The objective of our signaling problem is then 

= X] ’^<Tg{Ag,a)- 

(tES (tES 

We note that this signaling problem can alternatively be written as an (infinite-dimensional) 
linear program which searches over probability measures supported on A^ with expectation A. 
The separation oracle for the dual of this linear program is a mixture selection problem. Whereas 
we do not use this infinite-dimensional formulation nor its dual directly, we nevertheless show that 
the same conditions — noise stability and Lipschitz continuity — on the function g which lead to 
an approximation scheme for mixture selection also lead to a similar approximation scheme for our 
signaling problem with /(/r) = g{Ag). 

Lemma 4.1. If g is fl-stable and c-Lipschitz, then for any constants a,5 > 0, and for any integer 
s > 25~‘^ \n{2/a), there exists a signaling scheme p for which every posterior distribution is s- 
uniform, and F{p) > OPT — {a/3 + c6) where OPT denotes the value of the optimal signaling 
scheme. 

Proof. Let s > 25“^ ln(2/Q:), and let r E [m®] index all s-uniform posteriors, with fir denoting the 
r’th such posterior. For an arbitrary signaling scheme p = (S, f*o-)}(Tgs), we show that each 

posterior /Xg- can be decomposed into s-uniform posteriors without degrading the objective by more 
than a/3 + c5; more formally: 

1 . Pa can be expressed as a convex combination of s-uniform posteriors as follows. 

Pa = '^ Va^rPr with Va E A^* ■ (4.1) 

TE[m^] 


2. The value of objective function, i.e., g{Apa), is decreased by no more than a/3 + c6 through 
this decomposition, 

na,T ■ g{Apr) > g{Apa) - {a/3 + c5). (4.2) 

Tg[m®] 

^ is the point-wise lowest concave function h for which h{x) > f{x) for all x in the domain. Equivalently, the 
hypograph of is the convex hull of the hypograph of /. 
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The existence of such a decomposition follows from Theorem 12.51 Fix a, and let J1 G be the 
empirical distribution of s i.i.d. samples from distribution G A^- The vector J1 is itself a 
random variable supported on s-uniform posteriors, its expectation is and by Theorem 12.51 we 
have E[g{Ajl)] > — {a/S + c5). Therefore, by taking = Pr[/i = Jl-j-] for each r G [m®] we 

get the desired decomposition of 

The lemma follows by composing the decomposition (p with the decompositions of the posterior 
beliefs pa to yield a signaling scheme p with only s-uniform posteriors and F{p) > F{ip) — (a/3+C(5). 
Specifically, the signals of p are S x [m®], where signal (cr, r) has probability and induces 

the posterior Jlr- □ Using Equation (14.111 and ()4.2p . it is easy to verify that this describes a valid 
signaling scheme with F{(p) > F{ip) — {a/3 + c5). ■ 

Lemma 14.11 permits us to restrict attention to s-uniform posteriors without much loss in our 
objective. Since there are only m® such posteriors, a simple linear program with m® variables 
computes an approximately optimal signaling scheme. 

Theorem 4.2 (Polynomial-Time Signaling). If g is j3-stable and c-Lipschitz, then for any constant 
a, (5 > 0, there exists a deterministic algorithm that constructs a signaling scheme with objective 
value at least OPT — {a/S + cd), where OPT is the value of the optimal signaling scheme. Moreover, 
the algorithm runs in time poly(m'^ '"(1/“)) ■ n-T, where T denotes the time needed to evaluate g 
on a single input. 

Proof. Let s be an integer with s > (25“^ ln(2/a)), denote M = m®, and let {/ii,-- - ,^m} be 
the set of all s-uniform posteriors. Lemma l4.ll shows that restricting to s-uniform posteriors only 
introduces an a/3 + c6 additive loss in the objective. Thus it suffices to compute the optimal 
signaling scheme supported only on s-uniform posteriors. This can be done using the following 
linear program: 

maximize EiG[M] 

subject to EjG[M] (4-3) 

v G Am 

Note p,j is the j’th s-uniform posterior — the only variables in this LP are izi,..., vm- ■ 

Our proofs can be adapted to obtain a bi-criteria guarantee in the absence of Lipschitz conti¬ 
nuity, as in Section [2j The following Theorem follows easily, and we omit the details. 

Theorem 4.3 (Polynomial-Time Signaling (Bi-criteria)). Let g,h : [—1,1]” ^ such 

that g is /3-stable and h is a {6, p)-relaxation of g, and let a > 0 be a parameter. There exists a 
deterministic algorithm which, when given as input a matrix A G [—1, 1 ]”^™- and a prior distribution 
A G Am, constructs a signaling scheme (p = {{ly^, pLa)}a&T, such that 

Vah/Apa) > OPT — a/3 — p, 

o-es 

where OPT is the maximizer of F{p*) = X^o-es* signaling schemes p* = {(t'*, /^CT)}o-es* • 

Moreover, the algorithm runs in time poly(m'^ ^i°(i/“)) .n-T, where T denotes the time needed to 
evaluate h on a single input. 

^Note, however, that we can also “merge” all signals with the same posterior pr without loss. 
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Remarks We note that our proof suggests an extension of the result in Theorem 14.21 to cases 
in which / is given by a “black box” oracle, so long as we are promised that it is of the form 
/(/r) = g{A^). In this model the runtime of our algorithm does not depend on n, but instead 
depends on the cost of querying /. We also point out that that even though we precompute the 
quality of all posteriors, we can guarantee that our output signaling scheme uses at most m +1 
signals; this is because LP (j4.,Sh has only m+1 constraints, and therefore admits an optimal solution 
where at most m + 1 variables are non-zero. 

5 Signaling in Probabilistic Second-Price Auctions 

We examine signaling in probabilistic second-price auctions, as considered by Emek et ah [EFG^12 
and Miltersen and Sheffet [BMS12] . In this setting, the item being auctioned is probabilistic, and 
the instantiation of the item is known to the auctioneer but not to the bidders. The auctioneer 
commits to a signaling scheme for (partially) revealing information about the item for sale before 
subsequently running a second-price auction. We consider a probabilistic second-price auction 
described by the following parameters: 

- An integer n denoting the number of bidders. We index the players by the set [n] = {1,..., n}. 

- An integer m denoting the number of states of nature. We index states of nature by the set 
0 = {!,..., m}. Each 0 € 0 represents a possible instantiation of the item being sold. 

- A common-knowledge prior distribution A E on the states of nature. 

- A common-knowledge prior distribution V on valuation matrices V E [0,1]”^”^, given either 
explicitly or as a “black-box” sampling oracle. For a valuation matrix V, entry Vij denotes 
the value of player i for the item corresponding to state of nature j. 

The game being played is the following: (a) The auctioneer first commits to a signaling scheme 
^5 : 0 —>■ S; (b) A state of nature 0 E 0 is drawn according to A and revealed to the auctioneer 
but not the bidders; (c) The auctioneer reveals a public signal a ~ (p{9) to all the bidders; (d) A 
valuation matrix V E [0,1]"'^™’ is drawn according to P, and each player i learns his value Vij for 
each potential item j; (e) Finally, a second-price auction for the item is run. 

As an example, consider an auction for an umbrella: the state of nature 6 can be the weather 
tomorrow, which determines the utility Vi^$ of an umbrella to player i. We assume that A and P are 
independent. We also emphasize that a bidder knows nothing about 9 other than its distribution A 
and the public signal a, and the auctioneer knows nothing about V besides its distribution P prior 
to running the auction. 

We adopt the (unique) dominant-strategy truth-telling equilibrium as our solution concept. 
Specifically, given a signaling scheme (/? : 0 —)■ S and a signal cr E S, in the subgame corresponding 
to cr it is a dominant strategy for player i to bid 'EQr.,\\yiQ\ip{9) = cr] — his posterior expected 
value for the item conditioned on the received signal a. Therefore the item goes to the player with 
maximum posterior expected value, at a price equal to the second-highest posterior expected value. 

The algorithmic problem we consider is the one faced by the auctioneer in step (a) — namely 
computing an optimal signaling scheme — assuming the auctioneer looks to maximize expected 
revenue. It was shown in |EFG'*~12l IBMS12| that polynomial-time algorithms exist for several 
special cases of this problem. However, the general problem was shown to be NP-hard even with 
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3 bidders — specifically, no additive FPTAS exists unless P = NP. In this section, we resolve the 
approximation complexity of this basic signaling problem by giving an additive PTAS. We note 
that variations of this problem were considered in [GNSOTl IGD13] , with different constraints on 
the signaling scheme — the results in these works are not directly relevant to our model. 


5.1 Revenue is Stable 

Given a signaling scheme (p expressed as a decomposition {va-, Pa}(j&T, of the prior distribution A, 
we can express the auctioneer’s expected revenue as 

E Va E max2(V/io-), 

v~x> \ 

ctGE 

where the function max 2 returns the second largest entry of a given vector, i.e. max 2 (t) = tj 2 ]- To 
apply our main theorem, we need to show that the revenue in a subgame with posterior distribution 
/i E Am — namely max2(V/r) — can be written in the form g{Wfi) for a matrix W. To 

facilitate our discussion we assume that the valuation distribution V has finite support size C, 
though this is without loss of generality. Imagine we form a large matrix W by stacking matrices 
in the support of V on top of each other. Formally, W = [V^, Vj, • • • , where Vi is the Ah 
matrix in the support of P. When matrix Vi is drawn from P, we take the second-highest bid from 
the rows of W corresponding to Vi (rows (i — 1) • n -|- 1 to i • n, where n is the number players). For 
S C [nC] and t E [0, let max 25 (t) denote the second-highest value among entries of t indexed 
by S. Then we can write the auctioneer’s expected revenue as 

= E max25(v)(IT/x) 

L) 

where S'CP) is the set of rows in W corresponding to V. 

Lemma 5.1 (Smooth and Stable Revenue). The function = Ev.^d max 25 (y)(t) is 1- 

Lipschitz and 2-stable. 

Proof. Because max 25 is 1-Lipschitz for a fixed set of indices S, it follows that which is a 

convex combination of these 1-Lipschitz functions, is also 1-Lipschitz. 

To show that is stable, we first show that the function max2 : [0,1]” —>■ [0,1] is stable. 

Given t E [0,1]" and a random set R C [n] drawn from an a-light distribution P, the union bound 
implies that R includes neither of the two largest entries of t with probability at least 1 — 2a. In 
this case, the value of max2 is not affected by corruption of the entries indexed by R. Hence 


E 

R'^'D 


min{max2(t') : t' ^t} 


> (1 — 2a) • max2(t) -|- 2a • 0 > max2(t) — 2a. 


Therefore max2 is 2-stable, which implies that max 25 : [0,1]"'*^ —>■ [0,1] is also 2-stable for any 
fixed set of indices S. The function is a convex combination of functions of the form max 25 , 
and is therefore also 2-stable by Proposition 12.31 ■ 


Theorem 5.2. The revenue-maximizing signaling problem in probabilistic second-price auctions 
admits an additive PTAS when the valuation distribution is given explicitly, and an additive PR AS 
when the valuation distribution is given by a sampling oracle. 
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Proof. Lemma 15.11 shows that the function jg 2-stable and 1-Lipschitz. If the valuation 

distribution P is explicitly given with support size C, the function evaluated in 

poly(n, m, C) time. Then for any e > 0, it follows from Theorem 14.21 bv setting a = e/4 and 

5 = e /2 that there is a deterministic algorithm that computes a signaling scheme with expected 

revenue OPT — e, in time poly(n, C). 

If P is given via a sampling oracle, standard tail bounds and the union bound imply that 
C = 0((slogm + log( 7 “^))/e^) samples from P suffice to estimate to within 0(e) the revenue 
associated with every s-uniform posterior in A^, with success probability 1 — 7 . Since revenue 
is 0(l)-stable and 0(1)-Lipschitz, Lemma l4.ll implies that we can restrict attention to signaling 
schemes with s-uniform posteriors for s = poly(^). Proceeding as in Theorem l4.2[ using the revenue 
estimates from Monte-Carlo sampling in lieu of exact values, we can construct a signaling scheme 
with revenue OPT — e in time poly(n, log(i)), with success probability 1 — 7 . ■ 

6 Persuading Voters 

In this section, we apply our mixture selection framework to natural signaling problems encountered 
in the context of social choice, as introduced by Alonso and Camara [AC 14] . Consider an election 
with two possible outcomes, ‘Yes’ and ‘No’. For example, voters may need to choose whether to 
adopt a new law or social policy; board members of a company may need to decide whether to 
invest in a new project; and members of a jury must decide whether a defendant is declared guilty or 
not guilty. As in |AC14| . we focus on the scenario in which voters have uncertainty regarding their 
utilities for the two possible outcomes (e.g., the risks and rewards of the new project). Specifically, 
voters’ utilities are parameterized by an a-priori unknown state of nature 9 drawn from a common- 
knowledge prior distribution. We adopt the perspective of a principal with access to the realization 
of 6, and looking to influence the outcome of the election by signaling. 

Formally, we consider a voting setting with n voters and m states of nature. We index the 
voters by the set [n] = {1,... ,n}, and states of nature by the set 0 = {1,... ,m}. We assume 
voters’ preferences are given by a matrix U E [—1, where Uij denotes voter z’s utility in 

the event of a ‘Yes’ outcome in state of nature j. Without loss of generality, we assume utilities 
are normalized so that each voter’s utility for a ‘No’ outcome is 0 in each state of nature. A 
voter i who believes that the state of nature follows a distribution fi E Am has expected utility 
u{i, n) = X]jG 0 ^ ‘Yes’ outcome. In most voting systems with a binary outcome, including 

for example threshold voting rules, it is a dominant strategy to vote ‘Yes’ if the utility u{i, /r) is 
at least 0 and ‘No’ otherwise. For our approximation algorithms, we also allow implementation 
in approximate dominant strategies — i.e., we sometimes assume a voter votes ‘Yes’ if his utility 
u{i,n) is at least —6 for a small parameter dH We assume that the state of nature 0 E 0 is drawn 
from a common prior A E A^, and a principal with access to 0 reveals a public signal a prior to 
voters casting their votes. As usual, we adopt the perspective of a principal looking to commit to 
a signaling scheme y? : 0 ^ S, for some set of signals S. 

Alonso and Camara [AC 14] consider a principal interested in maximizing the probability that 
at least 50% (or some given threshold) of the voters vote ’Yes’, in expectation over states of 
nature. They characterize optimal signaling schemes analytically, though stop short of prescribing 
an algorithm for signaling. Theirs is the natural objective when the election employs a majority 

®Such relaxations seem necessary for our results. Moreover, depending on the context, modes of intervention for 
shifting the votes of voters who are close to being indifferent may be realistic. 
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(or threshold) voting rule, and the principal is interested in influencing the outcome of the vote. 
Approximating this objective requires nontrivial modifications to our framework, and therefore we 
begin this section by examining a different, yet also natural, objective: the expected number of 
’Yes’ votes. We design a bi-criteria approximation scheme for this objective, then describe the 
necessary modifications for the threshold function objective of |AC14] . 


6.1 Maximizing Expected Number of Votes 

We now examine bi-criteria approximation algorithms for maximizing the expected number of 
‘Yes’ votes. For our benchmark, we use the function ^(™te-sum) ^ 0]) where 

I\£] denotes the indicator function for event £. Assuming voters vote ‘Yes’ precisely when their 
posterior expected utility for a ‘Yes’ outcome is nonnegative, the number of ‘Yes’ votes when voters 
have preferences U E [—1,1]”^”^ and posterior belief /r E equals When the 

state of nature is distributed according to a common prior A, and voters are informed according 
to signaling (p inducing a decomposition {/io-, expected number of ‘Yes’ votes 

equals We use ([/, A) to denote the 

maximum value of signaling schemes ip. 

As the first step to apply our framework, we prove that ^(™te-sum) jg gi^^ble. 

Lemma 6.1. The function is l-stable. 

Proof. For each voter z E [n], let gi : [—1,1]"" ^ {0,1} be the function indicating whether voter i 
prefers the ‘Yes’ outcome, i.e., gi{t) = I[ti > 0]. Each individual gi is l-stable, because as long as the 

n 

i’th input ti is not corrupted the output of gi does not change. Therefore gr(™te-sum) 

i=l 

being a convex combination of l-stable functions, is l-stable by Proposition 12.31 ■ 

Unfortunately, jg not 0(1)-Lipschitz. We therefore employ the bi-criteria extension 

to our framework from Definition 12.71 Specifically, for a parameter <5 > 0, we assume a voter 
votes ‘Yes’ as long as his expected utility from a ‘Yes’ outcome is at least —5. Correspondingly, 
we define the relaxed function (f) := X]ie[n] expected number of ‘Yes’ 

votes from a signaling scheme p = {pa-, can analogously be written as jj^ .= 

E (vote-sum) / 7-T \ 

It is easy to verify that jg (4,0)-relaxation of ^(™te-sum) . combining this fact with 

Theorem 14.31 yields a bi-criteria approximation scheme for the problem of maximizing the expected 
number of ‘Yes’ votes. 

Theorem 6.2. Let e,6 > 0 be parameters, let U E [—1,1]’^^’” deseribe the preferenees of n voters 
in m states of nature, and let A E be the prior of states of nature. There is an algorithm 

with runtime poly(m~P ,n) for computing a signaling scheme p such that ^^^\p,U, X) > 

- e. 

Using the same techniques as in Section 13.21 we can extend this result to the case where the 
valuations of voters are drawn from a distribution give either explicitly or by a sampling oracle. 
We omit the details. 
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6.2 Maximizing Probability of a Majority Vote 

We now sketch the necessary modifications when the principal is interested in maximizing the 
probability of a ‘Yes’ outcome, assuming a majority voting rule. We make two relaxations, which 
appear necessary for our framework: we assume a voter votes ‘Yes’ as long as his expected utility 
from a ‘Yes’ outcome is at least —5, and assume that the ‘Yes’ outcome is attained when at least a 
(0.5 — 5) fraction of voters vote ‘Yes’. Our benchmark will be the maximum probability of a ’Yes’ 
outcome in the absence of these two relaxations. 

We define our benchmark using the function ^(™te-thresh) _ jj^(vote-sum)|-^^ ^ 
evaluates to 1 if at least half of its n inputs are nonnegative, and to 0 otherwise. This function is 
not 0(l)-stable, so we work with a more stringent benchmark which is. Specifically, for a parameter 
5 > 0, we use the function g,(™*®“®™°°*h-thresh) jg pgintwise greater than or equal to g(™te-thresh) ^ 

defined as follows: 

(vote-smooth-thresh) f ? (t) - 0.5 + j) if ^(vote-sum) ^ [g 5 _ g 5] 

|^(vote-thresh)(^) Otherwise. 

, (vote-smooth-thresh) v . - t r --i j_ , r 

Observe that applies a continuous piecewise-linear lunction to the output oi 

^(vote-sum) is easy to verify that jg i-stable, and upperbounds g(™te-thresh) _ 

Finally, to measure the quality of our output we define the relaxed function . 

[—I,!]” —{0,1}, which outputs 1 if at least a (0.5 — 6) fraction of its inputs exceed —6, and 

, J. n J-T, • TD c -J-- (vote-thresh) . /jr i 4.- f (vote-smooth-thresh) / , 

outputs U otherwise. By Defanition l2.7l is a (0, Uj-relaxation of (and, 

consequently, also of gr(™te-thresh) ^ _ 

As usual, let _p(™te-thresh) jj^ ^jvote-thresh) denote the functions which evaluate 

the quality of a signaling ip scheme using ^(™te-thresh) thresh) ^ j.ggpgg|^jYg}y_ Moreover, let 

(9PJ'(™te-thresh) yj ()g the maximum value of _p(™te-thresh) jj^ QYer signaling schemes tp. We 
apply Theorem 14.31 to ^(vote-smooth-thresh) ^ ggtting a = eJ, and use the fact that 

^(vote-smooth-thresh) gppgrbounds our true benchmark g,(™te-thresh) ^ conclude the following. 

Theorem 6.3. Let e,S > 0 be parameters, let U G [—1,1]”^™- describe the preferences of n voters 
in m states of nature, and let A G be the prior of states of nature. There is an algorithm with 

runtime poly(m ^ ,n) for computing a signaling scheme ip such that > 

Qprp (vote-thresh) ^ 

6.3 Connection to Maximum Feasible Subsystem of Linear Inequalities 

Turning our attention away from signaling, we note that g((™te-sum) simply counts the number 
of satisfied inequalities in the system Ax Y 0. Mixture selection for ^(™te-sum) therefore the 
problem of maximizing the number of satisfied inequalities over the simplex. Using our framework 
from Section [2l we obtain a bi-criteria PTAS for this problem. Moreover, using Monte-Carlo 
sampling, our bi-criteria PTAS extends to the model in which A is given implicitly; specifically, 
the rows of A correspond to the sample space of a distribution T> over [—1,1]™, and are weighted 
accordingly. In this implicit model, we can think of mixture selection for ^(™te-sum) gg problem 
of finding x G which maximizes the probability that a ■ x > 0 for o ~ P. 

Motivated by systems applications, Daskalakis et al. [DDD"*" 1^ consider a special case of this 
problem termed Fault- Tolerant Distributed Storage. Their problem is equivalent to mixture selection 
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for ^(™te-sum) implicit model, with the additional restriction that P is a product distribution 

over binary vectors with marginal probabilities given explicitly. They present an additive EPTAS 
for this problem in a uni-criteria sense. Our framework relaxes their restrictions on P, at the cost 
of a bi-criteria guarantee and exponential dependence on the error parameters. 

7 Signaling in Bayesian Normal Form Games 

We consider normal form games of incomplete information, in which payoffs are parameterized by 
a state of nature 6. A principal has access to the exact realization of 6, whereas the players initially 
share a prior belief on 9 and form a posterior belief based on the information revealed by the 
principal. The goal of the principal is then to commit to revealing certain information about 9 — 
i.e., a signaling scheme — to induce a favorable equilibrium over the resulting Bayesian subgames. 

Signaling in normal form games has recently been examined from a complexity-theoretic per¬ 
spective. Dughmi |Dugl4| considered the special case of two-player zero-sum games, and examined 
the design of symmetric signaling schemes with the goal of maximizing the expected utility of one of 
the players. It was shown that no FPTAS is possible for the signaling problem for zero sum games, 
assuming the planted clique conjecture. In this section, we complement the impossibility result 
of |Dugl4| with a bi-criteria quasi-polynomial time approximation scheme (QPTAS) which applies 
to normal form games with a constant number of players, slightly relaxing both the equilibrium 
definition and the polynomial-time restriction. It remains open if signaling for Bayesian zero-sum 
games admits a PTAS. 

7.1 Background and Notation 

We make heavy use of tensors in describing multi-player games. Specifically, we focus on n- 
dimensional tensors of order k with entries in [—1,1], where n is typically the number of strategies 
per player and k is the number of players. We think of these tensors as functions T : [n]^ —>■ [—1,1] 
mapping a strategy profile s G [n]^ to a number in [—1,1]. Such a tensor T also naturally describes 
a multilinear map; overloading notation, given k vectors x\,... ,Xk G M”' we write T(xi..., Xh) = 

X]si,...,sj,G[n] (^('^1) ■ ■ ■ ■> ■ ni=l Xi(si)'^. This is most natural when xi,..., G form a mixed 
strategy profile, in which case T(xi,... ,Xk) evaluates the expected value of T over pure strategy 
profiles drawn from (xi,..., x*,). 

A Bayesian normal form game is defined by the following parameters: 

- An integer k denoting the number of players, indexed by the set [k] = {1,..., A;}. 

- An integer n bounding the number of pure strategies of each player. Without loss of gen¬ 
erality, we assume each player has exactly n pure strategies, and index them by the set 
[n] = {!,... ,n}. 

- An integer m denoting the number of states of nature. We index states of nature by the set 
0 = {1,..., m}, and use the variable 9 to represent a state of nature. 

- A common prior distribution A G A^, on states of nature. 

- A family of payoff tensors Af : [n]^ —>• [—1,1], one per player i and state of nature 9, where 
A^{si,..., Sk) is the payoff to player i when the state of nature is 9 and each player j plays 
strategy Sj. 


19 






Note that a game of complete information is the special case with m = 1 — i.e., the state of 
nature is fixed and known to all. In a general Bayesian normal form game, absent any information 
about the state of nature beyond the prior A, risk neutral players will behave as in the complete 
information game We consider signaling schemes which partially and symmetrically 

inform players by publicly announcing a signal o", correlated with 0; this induces a common posterior 
belief on the state of nature for each value of a. When players’ posterior belief over 6 is given by 
fi G Am, we use to denote the equivalent complete information game 'Ei 0 ^^[A^]. As shorthand, 
we use A^{xi,... ,Xk) to denote E[Alf(si,... ,Sk)] when 6 ^ fj, ^ Am and Si ^ Xi ^ A„. In the 
event that the state of nature is 6 and players play the pure strategy profile si,..., we refer to 
the tuple { 6 ,si,..., Sk) as the state of play. For our result, we assume that a Bayesian game (Al, A) 
is represented explicitly as a vector A G and a list of tensors {Af G [—1,1]” : i £ [k],9 £ [m]}. 

We adopt the approximate Nash equilibrium as our equilibrium concept. There are two variants. 

Definition 7.1. Let e > 0. In a k-player n-action normal form game with expected payoffs in [—1,1] 
given by tensors Ai,... ,Ak, a mixed strategy profile xi,... ,Xk £ An is an e-Nash Equilibrium (e- 
NE) ^/ 

Ai{xi,... ,Xk) > Ai{ti,x-i) - e 
for every player i and alternative pure strategy f £ [n]. 

Definition 7.2. Let e > 0. In a k-player n-action normal form game with expected payoffs in 
[— 1 , 1 ] given by tensors Ai,... ,Ak, a mixed strategy profile xi,... ,Xk £ An is an e-well-supported 
Nash equilibrium (e-WSNE) if 

Ai{si,x-i) > Ai{ti,x-i) - e 

for every player i, strategy Si in the support of Xi, and alternative pure strategy f £ [n]. 

Clearly, every e-WSNE is also an e-NE. When e = 0, both correspond to the exact Nash 
Equilibrium. Note that we omitted reference to the state of nature in the above definitions — in a 
subgame corresponding to posterior beliefs fj, £ Am, we naturally use tensors A^,... A^ instead. 

Fixing an equilibrium concept (NE, e-NE, or e-WSNE), a Bayesian game (Al, A), and a signaling 
scheme (^ : 0 —)■ S, an equilibrium selection rule distinguishes an equilibrium strategy profile 
(xf,... ,xf.) to be played in each subgame a — we call the tuple X = {xf : a £ Ti,i £ [/c]} a 
Bayesian equilibrium of the game (Al, A) with signaling scheme ip. Together with the prior A, the 
Bayesian equilibrium X induces a distribution T G A0x[„]fc over states of play — we refer to T 
as a distribution of play. This is analogous to implementation of allocation rules in traditional 
mechanism design. 

Our results concern objectives which depend only on the state of play, and we seek to maximize 
the objective in expectation over the distribution of play. These include, but are not restricted to, 
the social welfare of the players, as well as weighted combinations of player utilities. Formally, our 
objective is described by a family of tensors : [n]^ —)■ [—1,1], one for each state of nature 0 G 0. 
Equivalently, we may think of the objective as describing the payoffs of an additional player in the 
game — namely the principal. For a distribution p, over states of nature, we use F^ 

to denote the principal’s expected utility in a subgame with posterior beliefs p, as a function of 
players’ strategies. 

For a signaling scheme p and associated (approximate) equilibria X = {xf : ct G G [fc]}, our 
objective function can be written as F{p,X) = E^...,j,<t[T'( 6 *, ^]. When p corresponds 

to a convex decomposition {{pa, z^( 7 )}o-gs prior distribution, this can be equivalently written 
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as F{ip,X) = Xltres OPT = OPT{A, X,P) denote the maximizer of F{(p*,X*) 

over signaling schemes tp* and (exact) Nash equilibria X*. We seek a signaling scheme y? : 0 —)■ S, 
as well as a Bayesian e-NE (or e-WSNE) X such that F{ip,X) > OPT — e. 

We will use the following Lemma, which follows easily from the results of Lipton et ah [LMMO.S] . 
to restrict attention to equilibria with small support. 

Lemma 7.3. Let tensors Ai,... ,Ak ■ [n]^ —>■ [—1,1] describe a k-player game of complete informa¬ 
tion with n pure strategies per player, and let X : [n]^ —>• [—1,1] be a tensor describing an objective 
function on mixed strategies. Define the function r(e) = ^(fc+i) in^fc+i) n) ^ e > 0, integer 

s > r{e), and mixed strategy profile x = (xi,..., Xk), there is a profile x = (xi,..., Xk) of s-uniform 
mixed strategies such that |.4.i(x) — Ai{x)\ < e for all players i, |X(x) — P{x)\ < e, and if x is a 
Nash equilibrium of A then x is an e-equilibrium of A. This holds for both NE and WSNE. 

Proof. We can think of the tensor : [n]^ —)• [—1,1] as describing the utility of an additional 
player in the game with a trivial strategy set. The rest follows from [LMMO.S] Theorem 2]. ■ 

7.2 QPTAS for Signaling in Normal Form Games 

We prove the following bi-criteria result. 

Theorem 7.4. Let e > 0 denote an approximation parameter, let (A, A) be a Bayesian normal form 
game with k = 0(1) players, n actions, and m states of nature, and let P : [m] x [n]^ —)■ [—1,1] 

ln(n/e) Inn 

be an objective function given as a tensor. There is an algorithm with runtime poly(m ? ,n~F~) 

which outputs a signaling scheme p> and corresponding Bayesian e-equilibria X satisfying F{ip, X) > 
OPT{A, \,P) — e. This holds for both approximate NE and approximate WSNE. 

In other words, when the number of players is a constant we can in quasi-polynomial time 
approximate the optimal reward from signaling while losing an additive e in the objective as well 
as in the incentive constraints, as compared to the optimal signaling scheme / Nash equilibrium 
combination. 

Eix e > 0. To prove this theorem, we define functions g and g^ which each take as input a k- 
player n-action game of complete information B, given as payoff tensors Bi... ,Bk ■ [n]^ ^ [—1,1], 
and an objective tensor Q : [n]^ —>■ [—1,1], and output a number in [—1, Ij. Specifically, g{B, Q) = 
max{^(x) : x G EQ{B)} and ge{B,Q) = max{^(x) : x G EQ^{B)}, where EQ{B) denotes the set 
of Nash equilibria of the game B, and EQ^{B) denotes the (non-empty) set of [r(e/4)]-uniform 
e-Nash equilibria (or e-WSNE) for r as given in Lemma 17.31 Recall that Q{x) denotes evaluating 
the multilinear map described by tensor Q at the mixed strategy profile x G A^. 

Now suppose we fix a Bayesian game (A, A) and objective tensor F as in the statement of 
Theorem Ym. Eor a subgame with a posterior distribution // G over states of nature, the 
principal’s expected utility at the “best” Nash equilibrium of this subgame can be written as 
g{A^,P^). Similarly, the principal’s expected utility at the “best” [r(e/4)]-uniform e-NE (or 
e-WSNE) can be written as gf:{A^,F^). Observe that the input to both g and g^ is a linear 
function of /r, as need to apply the results in Section HI Eor a signaling scheme p corresponding 
to a decomposition A = ' ho- of the prior distribution A into posterior distributions (see 

Section [4T]l . we can write the principal’s expected utility assuming the “best” Nash equilibrium as 
= X]o-gs ^o' 5 (F^'^ , Al^*^), and assuming the “best” [r(e/4)]-uniform e-equilibrium as F^{ip) = 
Xlo-es ^(rdeiF^'^ ,A^’’). We use OPT to denote the maximum value of F over all signaling schemes. 
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We prove Theorem 17.41 by exhibiting an algorithm for computing a signaling scheme (p such 
that > OPT — e. The proof hinges on two main lemmas. 

Lemma 7.5. The function g is 2{k + l)n^-stable. 

Proof. As noted in Section [2l any function mapping a hypercube [—1,1]^ to the interval [—1,1] is 
2N stable. The function g is such a function with N = [k l)n^. ■ 

Lemma 7.6. The function g^ is an {e/A, e/2)-relaxation of g. 

Proof. Consider tensors G,G ■ [n]^ [“1)1] with |^(s) — ^(s)| < e/4 for all s G [n]^, and two 

fe-player n-action games B = (. 81 ,. -^jBk) and B = {Bi,... ,Bk) with \Bi{s) — < e/4 for all 

s G [n]^. It suffices to show that ge{B,G) > g{B,G) — e/ 2 . Let x G be the Bayesian equilibrium 
of B for which G{x) = g{B,G). By Lemma 17.31 there is a profile x of [r(e/4)]-uniform mixed 
strategies such that x is an e/4-equilibrium of B, and G{x) > G{x) — e/4. Since B differs from B by 
at most e/4 everywhere, it follows that x is an e-equilibrium of B, i.e. x G EQ^(B). Similarly, since 
G differs from G by at most e/4 everywhere, it follows that G{x) > G(x) — e/4 > G{x) — e/ 2 . We 
conclude that g^{B,G) > G{x) > g{B,G) — e/ 2 . ■ 

We now complete the proof of Theorem 17.41 bv instantiating Theorem 14.,41 with g, h = g^, and 
a = ■ The runtime is poly(m P , {k -\- T)n^,T), where T is the time needed to evaluate 

g^ (and compute the corresponding [r(e/4)]-uniform e-equilibrium) on a given input. Recall that 
k = 0(1) and a = . Moreover, using brute-force enumeration of all [r(e/4)]-uniform mixed 

In n 

strategy profiles we conclude that T is bounded by a polynomial in n“?^. Therefore our total 

ln(n/e) In n 

runtime is poly(m P as needed. 

Remarks In the special case of two-player zero-sum games and a principal interested in max¬ 
imizing one player’s utility, as studied in |Dugl4| , our techniques lead to a more efficient ap¬ 
proximation scheme and a uni-criteria guarantee. This is because the principal’s payoff tensor 
G equals the payoff tensor B of one of the players (say, player 1), and consequently the func¬ 
tion g{B,G) = g{B,B) = max^, min^ xTUy is n^-stable and 2-Lipschitz. Its Lipschitz continuity 
follows from the fact that an e-equilibrium of a zero-sum game leads to utilities within e of the 
equilibrium utilities. Moreover, evaluating g now takes time T = poly(m, n). Theorem 14.21 in- 

ln(Ti/e) 

stantiated with a = and 5 = e/4, leads to an algorithm with runtime poly(m P ,n), which 
outputs a signaling scheme p and corresponding Bayesian (exact) Nash-equilibria X satisfying 
F{p, X) > OPT{A, A, F) - e. 

8 Hardness Results 

In this section, we present hardness results which justify our assumptions, and exhibit the limi¬ 
tations of our techniques. Specifically, we show in Sections 18.11 and 18.21 that neither stability nor 
Lipschitz continuity alone suffices for an additive PTAS. In Section 18.31 we show that even in the 
presence of Lipschitz continuity and noise stability, obtaining an additive FPTAS would imply P 
= NP. 
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8.1 NP-Hardness in the Absence of Lipschitz Continuity 

We now show that stability alone does not suffice for an additive PTAS for mixture selection, in 
general. First, we show that mixture selection for the 1-stable function presented in 

Section [6l does not admit a (uni-criteria) additive PTAS unless P = NP. Being that ^(™te-sum) jg 
not continuous in any metric, we drive the point home by exhibiting a “smoothed” function 
which is 1-stable and 0(1)-Lipschitz with respect to but not 0(1)-Lipschitz with respect to 
and show that mixture selection for still does not admit an additive PTAS unless P = 

NP. 

Both NP-hardness results share a similar reduction from the maximum independent set problem. 
We use a consequence of the result by |KS12j . namely that there exists a constant e such that it 
is NP-hard to approximate maximum independent set to within an additive error of en, where n 
denotes the number of vertices. 

Given an n-node undirected graph G, let OPTjs = OPTis(G) be the size of its largest indepen¬ 
dent set. We define the n x n matrix A = A[G) as follows: 

- Diagonal entries of A are all ^ ^ for all 1 < i < n). 

- When vertices i and j share an edge in G, both Ajj and Aj^i are —1. 

- All other entries of A, namely Ajj for non-adjacent distinct vertices i and j, are 
We relate OPTjs to convex combinations of the columns of A as follows. 

Observation 8.1. Let I be an independent set of G with \X\ = k. There exists x G A„ such that 
k entries of Ax are at least and all remaining entries are strictly negative. 

Proof. Let x G A„ be the normalized indicator vector of X — i.e., Xj = ^ if i G X and Xj = 0 
otherwise. By construction (Ax)i = — {k — 1)^) > ^ whenever i G X, and {Ax)i < — ^ 

otherwise. ■ 

Observation 8.2. For any x G A„, nonnegative entries of Ax correspond to an independent set 
of G. Consequently, Ax can have at most OPTjs nonnegative entries. 

Proof. Let t = Ax. Consider an edge {i,j} of graph G, and without loss of generality assume that 
Xj > Xj. If Xj = 0, then L < — ^ < 0 by construction. Othewise, tj < ^ — Xi < 0. Therefore, ti 
and tj cannot be both nonnegative. We conclude that the nonnegative coordinates of t correspond 
to an independent set of G. ■ 

Observations 18.11 and 18.21 imply that max^j^A^ (Ax) = . Combined with the fact 

that obtaining an additive PTAS for the maximum independent set problem is NP-hard, we get 
the following theorem. 

Theorem 8.3. Mixture selection for the 1-stable function admits no additive PTAS 

unless P = NP. 

Noting that ^(™te-sum) jg ^ discontinuous function, for emphasis we exhibit a function ^(®^°p®) 
which is Lipschitz continuous in (but not in L°°) and 1-noise stable, but for which the same 
impossibility result holds by an identical reduction. Informally, “smoothes” the threshold 

behavior of g(™te-sum) ^^g fg^ows: each input L contributes 0 to when ti < 0, contributes 
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i when ti > and the contribution is a linear function of increasing from 0 to ^ for ti G [0, ^]. 
Formally, we define min {4max {0, tj} , Since each entry of t contributes 

at most ^ to (t), it is easy to verify that is 1-stable. Moreover, since the partial 

derivatives of (t) are upper-bounded by 4, it is 4-Lipschitz continuous with respect to the 

metric. Observations IHTTl and imply that maXa;gA„ , ruling out an additive 

PTAS for mixture selection for gr(siope)_ 

Theorem 8.4. The function is 1-stable and 0{1)-Lipschitz with respect to , and yet 

mixture selection for admits no additive PTAS unless P = NP. 

8.2 Planted-Clique Hardness in the Absence of Stability 

We now present evidence that Lipschitz continuity alone does not suffice for a PTAS for mixture 
selection. Recalling that a quasipolynomial time algorithm follows from our framework whenever a 
function is 0(1)-Lipschitz, we reduce from the planted clique problem —for which a quasipolynomial 
time algorithm exists, and yet a polynomial-time algorithm is conjectured not to exist—rather than 
from an NP-hard problem. 

In the planted clique problem, one must distinguish the n-node Erdos-Renyi random graph 
G{n, i), in which each edge is included independently with probability i, from the graph G{n, i, k) 
formed by “planting” a clique in G{n,^) at a randomly (or, equivalently, adversarially) chosen 
set of k nodes. This problem was first considered by by Jerrum |Jer92j and Kucera |Kuc95j . and 
has been the subject of a large body of work since. A quasi-polynomial time algorithm exists 
when k > 2 log re, and the best polynomial-time algorithms only succeed for k = Gl[y/n) (see e.g., 
|AKS98j [DCdPllj [FRin] |(X)in] l. Several papers suggest that the problem is hard for k = o{y/n) 
by ruling out natural classes of algorithmic approaches (e.g. [.Ter ^ rFmlWfP + 1,4] l. The planted 
clique problem has therefore found use as a hardness assumption in a variety of applications (e.g. 
[AAK+OT] . [.TPOnj . [HKllj . [MVn9] . |Dugl4| ). We use the following well-believed conjecture as our 
hardness assumption. 

Assumption 8.5. For some function k = k{n) satisfying k = w(log^re) and k = o{y/n), there is 
no probabilistic polynomial-time algorithm that can distinguish between a random graph drawn from 
G{n,^) and a random graph drawn from G{n,^, k) with success probability 1 — o(l). 

We let k = k{n) be as in Assumption 18.51 and consider mixture selection for the function 
^Ehque) _ where fjj] denotes the Pth largest entry 

of the vector t. It is easy to verify that is 3-Lipschitz with respect to the L°° metric, yet is 

not 0(l)-stable. We prove the following theorem. 

Theorem 8.6. Assumption |g.5l implies that there is no additive PTAS for mixture selection for 

(clique) 

9 k ■ 

To prove Theorem 18.61 we show that max^-g^n {Ax) is arbitrarily close to 1 with high 

probability when A is the adjacency matrix of G ~ G{n, \,k), and is bounded away from 1 with 
high probability when A is the adjacency matrix of G ~ G{n, ^). For convenience, and without 
loss of generality, we assume that both random graphs include each self-loop with probability ^ — 
i.e., diagonal entries of the adjacency matrix A are independent uniform draws from {0,1} in both 
cases. Our argument is captured by the following two lemmas. 
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Lemma 8.7. Fix a constant e > 0. Let G ~ G{n, ^,k), and let A be its adjacency matrix. With 
probability 1 — o(l), there exists an x £ A„ such that > 1 — e. 

Proof. Let C denote the vertices of the planted fc-clique. We set Xj = ^ if z G C and 0 otherwise. 
Let t = Ax. For i £ C, ti > 1 — p On the other hand, all other entries of t concentrate around 
I with high probability. For i ^ C, ti is simply the average of k independent Bernoulli random 
variables by definition of Q{n,^,k)] using Hoeffding’s inequality, we bound the probability that U 
deviates from its expectation by more than a constant 5 > 0, to be chosen later: 


Pr 


1 

ti — - 

> (5 

2 



< 2e 


-25‘^k 


By the union bound, ti £ [^ — 6, ^ + 6] simultaneously for all z ^ C with probability at least 
1 _ 2 iog^-!^(fc) = 1 _ o(l). Thus - tin] < 26 and = t[fc] - it[k+i] - t[n]) > 1 - ^ - 2(5 

with probability 1 — o(l). Choosing 6 = e/3, we conclude that > 1 — e with probability 

l-o(l). ■ 

Lemma 8 . 8 . Fix a constant e > 0. Let G ~ G{n,^), and let A be its adjacency matrix. With 
probability 1 — o(l), g^^^''^'^^\Ax) < | + e for all x £ A„. 

Proof. Recall that is 0(1)-Lipschitz and — like any other function from the hypercube 

to the bounded interval — 0(rz)-stable. If there exists x* such that > | + e, then 

Theorem 12.51 implies that there is an integer s = O(logn) and an s-uniform vector x such that 
^^ciique) There are n® such vectors. We next show that for an arbitrary fixed vector 

X £ An the probability that g^^^^'^'^^\Ax) > | is at most This would complete the proof by 

the union bound, since 1 — = 1 — o(l). 

Fix X G An, and let t = Ax. Define V as the distribution supported on [0,1] which is sampled 
as follows: draw a uniformly from {0,1}”, and output a ■ x. Since A is the adjacency matrix of 
G ~ G{n, ^), each entry ti of t can be viewed as an independent draw from F. We exploit a key 
property of F in our proof, namely the fact that F is symmetric about p Formally we mean that 
Prx>[z’] = Prx)[l — r] for all r £ [0,1], and this follows easily from the definition of F. 

Symmetry of F implies that Pvrr^vlr > ^] = > p Recalling that k = o(n) and 

that entries of t are independent draws from F, the Chernoff bound implies that the following holds 
with probability at least 1 — 

t[n] < < t[k+l]- ( 8 . 1 ) 

If > |, then the following two conditions must hold: 

1 . t[fc] > |, and 

2- i[fc+i] - tin] < 5- 

Condition [1] implies that the k largest entries of t are all at least |. Furthermore, unless Inequal¬ 
ity (18.ip is violated — an event with small probability 2 “^^”) — Condition [5] implies that remaining 
entries of t are all strictly between ^ and |. Let p denote Pir^vV < 5 ]) also equal to > |] 

by symmetry of F. The probability that k entries of t are at least | and all remaining entries 
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are in (|, |) is given by — 2p)^ which is maximized at p = with maximum value 

2 -n(fc)_ summary, the probability that (Ax) > | is at most = 2“^^^), as 

needed. ■ 


8.3 NP-hardness of an additive FPTAS 

Our last hardness proof rules out an additive FPTAS for the lottery design problem, a.k.a. mixture 
selection for the function ^ as defined in Section [3l We restrict attention to the weight vector 

w assigning equal weight to all inputs to and therefore omit references to the weight vector 

for the remainder of this section. 

Theorem 8.9. The lottery design problem, a.k.a. mixture selection for , admits no additive 

FPTAS unless P = NP. 

Proof. The proof involves a reduction from the independent set problem which is very similar to the 
reduction in Section [8.11 so we only detail the necessary modifications. Given an n-node undirected 
graph G, we define an n x n matrix A = A{G) as in Section [8.11 though shifted and normalized so 
entries lie in [0,1]. Specifically, we set diagonal entries of A to |, and we set an off-diagonal entry 
Aij to 0 if i and j share an edge and to ^ ^ otherwise. Observation 18.21 implies that for every 

X G A„, entries of Ax which are at least ^ correspond to an independent set of G. Observation 
18.11 implies that there is a vector x* G A„ so that Ax* has exactly OPTis(G) entries no less than 
1 + A 

n ■ 2^ Sn- 

Our input to the lottery design problem will be a valuation matrix B with 8n^ -|- n rows and n 
columns, obtained from A by adding 8n^ “dummy” rows, each of which is (p*,... ,p*). Setting a 
price of p* for the lottery x* results in expected revenue r* := p* ■ = (^ + ^)' > 

Therefore, r* lower-bounds max 2 ,gAn We claim that r* is in fact the maximum 

value of this mixture selection problem, and prove this by fixing an arbitrary lottery x G A^, and 
conducting a simple case analysis on the associated price p\ 


- When p* < p < 1, none of the “dummy” types purchase the lottery x, resulting in expected 


revenue at most 


SnT+r 


< i <r* 


When ^ < p < p*, all dummy types purchase x, and the i’th non-dummy type purchases x 
only if (Ax)i > Since entries of Ax which are no less than ^ correspond to an independent 
set of G, at most 8n^ -|- OPTis types purchase the lottery x, resulting in expected revenue at 
“ost p • < r*. 

When 0 < p < the best case scenario is that all buyer types purchase the lottery at price 
p, yielding expected revenue at most p < ^ < r*. 


Recalling that there exists a constant e > 0 for which the maximum independent set problem 
does not admit an additive en-approximation algorithm in polynomial time, we conclude that 
mixture selection for does not admit a polynomial-time additive e'-approximation algorithm 

for any e' = e'(n) < (^ -k 


Sn ' 8n+l 


This rules out an additive FPTAS. 
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9 Connection to Boolean Function Analysis 

In Definition [221 stability of a function from the solid hypercube to the bounded interval was defined 
as robustness to corruption patterns which follow a light distribution. In this section, we exhibit a 
closely-related algebraic notion of stability, based on Fourier analysis of Boolean functions. Instead 
of using light distributions directly, we describe the effects of corruption on a function g at input t in 
the solid hypercube by a Boolean function ht- In particular, ht takes as input a vector 2 ; E {—1, !}"■, 
interprets z* = 1 as forbidding corruption of input and Zj = — 1 as permitting corruption of ti, 
and considers the worst-case such corruption of t. Formally, denoting X'*“(z) := {i E [n] : Zj = 1} 
and 1~{z) := {i E [n] : Zi = —1}, we define Boolean extensions of g at an input t. 

Definition 9.1 (Boolean extension). Let g : [—1,1]"' —>• [—1,1], and let t E [—1,1]". A Boolean 
function ht : {—1,1}" ^ [—1,1] is called a Boolean extension of g at t if it satisfies: 

ht{'i-)=9{t) (9.1) 

hAz) < mm{g{t') : t' « t}. (9-2) 

x-{z) 

Next we define Algebraic Stability for g{t)^ using the notion of the Fourier transform of a 
Boolean function (e.g. [O’DUj i. 

Definition 9.2 (Algebraic Stability). A function g{t) : [—1,1]" —>■ [—1,1] is algebraically k-stable 
if for every t E [—1,1]" there exists a Boolean extension hfiz) at t such that: 

ht{S) > 0 for all S C [n] (9-3) 

ht{S) = 0 for all S such that jiSj > k, (9-4) 

where hfiS) is the Fourier coefficient of ht at S F [n]. 

In the parlance of Boolean function analysis, g is algebraically stable if for all t, the Fourier 
spectrum of ht is both nonnegative and low-degree. We prove the following analogue of Theorem l2.5l 

Theorem 9.3. Let g : [—1,1]" —>■ [—1,1] be algebraically k-stable and c-Lipschitz in L°°, let A be 
an n X m matrix with entries in [—1,1], let <5, e > 0, and let s > -^ ■ log ^ be an integer. 

Fix a vector x E A^, and let the random variable x denote the empirical distribution of s i.i.d. 
samples from probability distribution x. The following then holds: 

'Efig{Ax)] > g{Ax) — e — c5, (9.5) 

X 

Proof sketch. Let t = Ax. Consider the random variable t = Ax where x is the (random) empirical 
distribution. Let Et denote the event that \ti — ti\ < S for some 6 E (0,1), i.e., that coordinate ti 
is approximately preserved. Dehne a Boolean function px) : { — 1,1}" —t [0,1], 

PT>{z) := ^A{Fitzx+{z)Ei) n (nigi-(^).Ei)]. (9.6) 

In particular, pv{z) is the probability that only coordinates in X+(z) are approximately preserved. 
It is easy to see that Az(^{-i,i}^P'dA = 1 and Pr[£'i] = Pr[lti - t] < <5] = 'A,z-.zi=iP'D{E)- 
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Next, we state some standard equalities for Boolean functions. Let 1 denote the all-one n- 
dimensional vector (1,1,... , 1) and ht be a Boolean extension of gt as stated in Definition 19.21 

^ ht{z) ■ pv{z) = 2^ fB{S) ■ ht{S) (9.7) 

SC[n] 

ht{S) = ht{l) (9.8) 

SC[n] 

By Definition 19.11 ht(l) = g{t). Since g{t) is c-Lipschitz in we know: 

'&[g{Ax)\ = E[g{t)] > V ht{z) ■ pv{z) - cd, (9.9) 

so it suffices to prove ht{S) — 2”' Ylscln]I^(^) ' ^tiS) < e, which can be rewritten as: 

il-2^fBiS))ht{S)+ (9.10) 

S:|S|<fc S:|S|>fc 

As the latter term in (I9.10h is zero by definition 19.21 we only need to upper bound X]s-|5|<A:(^ “ 

‘^^Pv{S))ht{S). Using a simple union-bound argument, one can prove that with sample size s > 
^ log ^ for all S C [n] with lAI < k. As ht{S) > 0 for 5 with |5| < k, we 

have X] 5 -|s|<fc(l “ ‘2A^{S))ht{S) < Combined with equation ()9.9p we have 

E[g{Ax)] > g{t) -e-c6. ■ 


9.1 Algebraic Stability for our Applications 

We observe that algebraic stability holds for the objective functions in some our applications, 
and leave open whether it holds for all. Specifically, the relevant functions in lottery design, 
revenue maximizing signaling, and one of our two voting problems, are algebraically 0(l)-stable, 
and therefore an additive PTAS for each of those applications follows from Theorem 19.31 iust as it 
did from Theorem 12.51 We list the Boolean extension associated with each of these applications. 


Lemma 9.4 (Lottery design). For a fixed price p, a Boolean extension of the objective function 

gw°p'''^^\t) := p ■ ■ I[ti > p]) at t is = p ■ X]iG[n] > p]- We enumerate 

over all prices p, up to a suitable discretization, in the algorithmic solution. 

Lemma 9.5 (Persuasion in voting). A Boolean extension of at t is = 

Zj + l 

2-^i£[n] n 2 ’ 

Lemma 9.6 (Revenue maximization). A Boolean extension of the function max2 : [0,1]"' —[0,1] 
at t is h['^°'^^\z) = , where i and j denotes the indices of the largest and second-largest 

entry oft, respectively. 
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